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O Abstract 

CN S'-adenosylhomocysteine hydrolase (SAHH), a ceUular enzyme that plays a key role in methylation 

^ ' reactions including those required for maturation of viral mRNA, is an important drug target in the 
discovery of antiviral agents. While targeting the active site is a straightforward strategy of enzyme 
_ 2 inhibition, evidences of allosteric modulation of active site in many enzymes underscore the molecular 
origin of signal transduction. Information of co-evolving sequences in SAHH family and the key residues 
for functional dynamics that can be identified using native topology of the enzyme provide glimpses 



^ into how the allosteric signaling network, dispersed over the molecular structure, coordinates intra- and 

o 

OO inter-subunit conformational dynamics. To study the link between the allosteric communication and 

o^ 

functional dynamics of SAHHs, we performed Brownian dynamics simulations by building a coarse- 

O 

I grained model based on the holo and ligand-bound structures. The simulations of ligand-induced 
^— I transition revealed that the signal of intra-subunit closure dynamics is transmitted to form inter- 
• ^ subunit contacts, which in turn invoke a precise alignment of active site, followed by the dimer-dimer 

X 

rotation that compacts the whole tetrameric structure. Further analyses of SAHH dynamics associated 
with ligand binding provided evidence of both induced fit and population shift mechanisms, and also 
showed that the transition state ensemble is akin to the ligand-bound state. Besides the formation of 
enzyme-ligand contacts at the active site, the allosteric couplings from the residues distal to the active 
site is vital to the enzymatic function. 
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I. INTRODUCTION 



S'-adenosylhomocysteine (SAH) hydrolase catalyzes the hydrolytic cleavage of SAH to adeno- 
sine and L-homocysteine. Inhibition of this enzyme causes the accumulation of SAH, and con- 
sequently suppresses S'-adenosyl-L-methionine dependent transmethylation via a feedback inhi- 
bition mechanism. Since the methylation at the 5 '-terminus of mRNA is crucial for the viral 
replication, SAHH is a promising target for the discovery of broad spectrum antiviral agents pLj . 

Developing therapeutic agents that can directly bind and regulate the active site of a biological 
target has been a dominant pharmacological strategy j2l [3]. For SAHH, various adenosine 
analogues, including carbocyclic adenosine, neplanocin A, 3-deazaneplanocin A p], and fluoro- 
neplanocin A (F-NpcA), are the recently developed inhibitors that directly target the active site 
[1] . However, conformational flexibility of enzyme structures gleaned in x-ray, NMR experiments 
and the presence of allosteric site revealed in mutational studies highlight the allosteric couplings 
of residues distal to the active site as an another important principle in drug design strategy 
[5H7] . Although the formation of specific enzyme-substrate contacts in the active site is required 
for the catalytic activity, allosteric orchestration among residues, dispersed over the molecular 
architecture, is also essential to regulate conformational fluctuations, so as to assist a precise 
positioning of catalytic elements in the active site [8j. SAHH, an enzyme consisting of chemically 
identical four subunits, each of which undergoes the open-to-closed (O— t-C) transition in response 
to substrate binding, is an interesting system to study the link between enzymatic function and 
allosteric dynamics beyond monomeric enzyme PMI5]. 

Structure, dynamics and catalytic function are the three main themes in understanding en- 
zymes [Ul [TBVEU] ■ For the past decades, much effort has been made to decode the link between 
the allosteric signaling of enzymes, conformational dynamics, and their function by using both 
theories [IDl [lH EIHSl] and experiments [131 EOl I25ti27] , and has recently been extended to 
the studies of molecular motors [281 - I30] . To gain microscopic understanding to the allostery in 
SAHH and its implication to the catalysis, we employed multifaceted computational approaches: 
(i) Statistical coupling analysis (SCA) |211 [31], |32] was used to reveal networks of co-evolving 
amino acid residues in the SAHH family, (ii) Structural perturbation method (SPM) adapting 
normal mode analysis (NMA) identified the network of hot spot residues associated with the 
functional motion of SAHH [23]. (iii) While the SCA and SPM provide hints as to the corre- 
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lation between allosteric signaling network and conformational dynamics, it is difficult to gain 
from these two static analyses further insights into a large scale conformational change, such 
as a shifting of statistical ensemble and heterogeneity in the transition routes. To this end, we 
performed Brownian dynamics (BD) simulations of the O— t-C transition of tetrameric SAHH in 
response to ligand binding by using a structure-based coarse-grained model [TUl [T21 EOl ESHSS] . 

Among the various stages of the SAHH enzymatic cycle that involve multiple catalytic pro- 
cesses [Ml ET], which occur on time scales of ~ (10 — 100) ms, main focus of our simulation is on 
the fast (~ (1 — 100) fis) conformational dynamics of SAHH before, after as well as in the pro- 
cess of substrate binding. Despite the large time scale gap between catalysis and conformational 
dynamics associated with ligand binding, it is important to understand the conformational fluc- 
tuations and dynamics since they lie at the core of allosteric regulation of catalytic power in 
enzymes. In this paper, by analyzing the dynamics resulting from our simulation, we highlight 
the link between the functional dynamics and allosteric signaling implicated in the SCA and 
SPM. 

II. METHODS 

SAHH structure: A subunit structure, consisting of 432 residues, retains catalytic, 
cofactor-binding and C-terminal domains. An alpha helix (aAl) and loops (residues 182-196, 
352-354) join the catalytic domain with the cofactor-binding domain, forming a hinge region 
(Fig(T}\ left, see also Figure SI). The interaction between C-terminal domain of a subunit and 
cofactor-binding domain of the partner subunit (FigjljA right) is employed to assemble two 
dimer complexes (AB and CD) into the tetramer. A negatively charged central core channel 
constitutes the interface between A and B (or C and D) (Fig{T|3). A further assembly between 
the two dimers (AB and CD), whose interface exhibits electrostatic complimentarity, forges an 
oblate-like tetrameric structure (FigjljB). Depending on the presence of ligand in the binding 
cleft between catalytic and cofactor-binding domains, SAHHs select either open or closed 
conformation whose root-mean-square deviation (RMSD, Aqc) is 4.2 A. The difference of the 
two subunit structures can also be quantified using an angle, a, defined among W112, K186 
and W310, which is a = 70° for the open and a = 50° for the closed structures (FigjljA.). In 
addition, an angle /3 measures the relative orientation between AB and CD dimers; /3 = 3° for 
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the open and (3 = 15° for the closed structures (Fig IT 



Statistical coupling analysis (SCA) on SAHH family: If two amino acid residues in a 
protein are functionally or energetically coupled, a mutation at one site is expected to influence 
the other sites to level off the energetic perturbation, so that the protein can retain its functional 
state. While double or triple mutant experiment is laborious to prove this proposition, evolu- 
tionary data imprinted in multiple sequence alignment (MSA) of a protein family still allow one 
to infer the coupling between two distal sites. Statistical coupling analysis (SCA) is a method 
to identify a set of co-evolving amino acid sites by performing mutation "experiment" on the 
MSA of a protein family [HI |3ll [32] . 

The whole sequence space of SAHH family, which includes 1156 sequences over the entire 
organisms, was clustered by their sequence identities, and 252 sequences under 75% identities 
were finally selected. This diverse sequences of SAHH family were used to produce MSA using 
the multiple sequence comparison by log-expectation (MUSCLE) program [38]. Each of the 
sequences has 1,242 positions including gaps in the full MSA. Using multiple sequence align- 
ment (MSA) of SAHH family, we calculated the statistical free energy like function defined as 
AGi/ksT* = X]a=i [Pi log {vt /Pa)f where is the number of types of amino acid at posi- 
tion i along the sequence, a denotes amino acid species, p° is the frequency of an amino acid a 
at position z, and pa is the frequency of an amino acid a in the full MSA. AGj (Figj2]A.) identi- 
fies highly conserved sequence positions in ligand- and cofactor-binding site and the C-terminal 
domain that constitutes a part of the cofactor-binding site (Fig|2]B). 

Further analysis of the MSA enables us to identify the set of mutually correlated residues in 
SAHH from an evolutionary perspective. The statistical coupling analysis (SCA) jUl EH 
calculates AAG^j/kBT*{= ^ ^ E^ii [Pi,R iPi,RlVc) - vt log {pVVa)] ^ where is the 
frequency of amino acid in a restricted subalignment with respect to the position j), a score 
matrix that quantifies the effect of sequence "perturbation" at position j on position i by 
restricting the full MSA into a subalignment with the most conserved sequence at the j-th 
position and quantifying the sequence variation at the z-th position of subalignment with respect 
to the same position of the full MSA. Since the choice of appropriate size for subalignments 
(/) is critical in obtaining statistically significant correlations between residues in a protein 
family, the smallest value of / should be chosen to satisfy the central limit theorem |32]. In 
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order to determine the suitable / value, we build subalignments from the MSA with different 
/ (< 1) values by randomly choosing /Nmsa sequences, where Nmsa is the size of the full 
MSA. For a given /, we generated 1000 subalignments and computed (AG*!) / = Yl!k=i 
and 0"! = (AG|)2 — AGf , where AG* means the average AGj values in a randomly chosen 
subalignment. By analyzing the distribution of these values, we selected / = 0.5 to statistically 
significant subalignment size (Figure S2A), and 213 positions were allowed to be perturbed in 
this condition. In the AAGij/ksT* scoring matrix, AAGij with a high value signifies that the 
i-th position is susceptible to the perturbation at the j-th position. After filtering out the noise, 
the allosteric signaling network can be obtained by adopting the coupled two-way clustering 
algorithm to cluster the position and perturbation elements of AAGij exhibiting a high signal 

Elastic network model (ENM) and structural perturbation method (SPM): In ENM, 
a protein structure is represented as a network of beads connected with mechanical springs of 
uniform force constant ko- The potential energy for ENM given as Eenm = ^oi^ij ~ 

r°jY<d{r°j — Rc), where rij is the distance between residues i and j, and 9(. . .) is the Heaviside 
step function, connects two coarse-grained center (G^ atoms) if the distance r°. in the native 
structure is within a cutoff distance Rc- By calculating the Hessian matrix of Eenm one can 
obtain normal modes {um, M>7, 1<M<6 are associated with translational and rotational 
degrees of freedom) of a protein architecture. Given the open (ro) and closed structures (rc) 
of SAHH, the overlap between um and the O— )-C conformational change (ro^c) is assessed by 
calculating cos {um ■ ro^c) (Figjs]). 

The SPM assesses the importance of a residue in the elastic network by locally perturbing the 
residue and probing its response. The perturbation is realized by changing the force constant 
of the springs that link the residue and its neighbors. The response to the perturbation in 
the frequency of mode M, which overlaps the most with the O— t-C structural transition, is 
calculated using 6u{M,n) = z7j^ ■ 6H ■ um where 5H is the Hessian matrix of the following 
perturbed energy potential: 6Eenm = ^^oi^ij ~ ~ Rc)- The higher 5u{M^i) 

signifies an importance of i-th residue with respect to the mode M. 

Energy potential and simulations: To define the energy potential of SAHH, we used an 
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x-ray crystal structure of human SAHH structure (PDB id: 3NJ4) that contains F-NpcA as 
the closed structure and a homology model based on rat SAHH as the open (or holo) structure 
[1] . The open structure of SAHH should be stable in the absence of ligand, but the formation 
of the inter-residue contacts incorporated from the closed structure should bring SAHH from 
the open to closed forms in the presence of SAHH-ligand interactions, by further stabilizing 
the ligand-bound SAHH structure [12]. To implement this dynamical behavior, we used the 
self-organized polymer (SOP) energy potential [IDl [El IS]: Htot = i/SAHH ^ ^Ug ^ ^sAHH-iig 
where i/SAHH ^ ^^^^ ^ ifSAHH(,) + E.>, ^fnte?^(^,j) and i/SAHH ^ jj^^^^ + ^Q^gig^g 
of the terms involving intra-subunit connectivity using finite extensible nonlinear elastic poten- 
tial (-^fene) and non-bonded potential term (i/nb) that stabilizes either one of open or closed 
monomer structure depending on the presence of ligand at its binding cleft. The energy contri- 
butions from the ifnb are the sum of open native contacts (hI^^) and pure closed native contacts 
^icnon [12]. The enzyme-ligand interaction (ijSAHH iig-j gj^ould contribute to the SAHH com- 
paction. 

We simulated the SAHH conformational dynamics under an overdamped condition by inte- 
grating the Brownian dynamics algorithm [121 IH] . The ligand binding dynamics were simulated 
by releasing four ligands at 20 A away from each binding cleft under a periodic and weak 
harmonic constraint (/c=0.035 pN/nm) imposed every 25 fis, which was used to ensure the lig- 
ands to bind the binding cleft within our simulation time. The initial distance of the ligands 
from each binding pocket provides enough time to randomize the orientation of the ligand be- 
fore it reaches the binding cleft (see also Movies SI and S2). While the form of SOP potential 
and simulation strategy described above are essentially the same as those used in the previous 
study on protein kinase A [12], the additional term that takes into account the inter-subunit 
interaction Hf^^^ substantially enriches the resulting dynamics. 

III. RESULTS AND DISCUSSION 

Allosteric signaling network inferred from co-evolving residues: Employing 
SCA to the MSA of SAHH family, we identified two co-evolving clusters of residues (Fig|2p 
and Table 1 for the list). It is noteworthy that since SAHHs function as a tetramer, the 
co-evolving clusters revealed from monomer sequence are indeed the consequence of allosteric 
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communication between residues spread over the entire tetrameric architecture. Most of 
co-evolving residues identified by the SCA are not well-conserved over the SAHH family except 
for the residues in C-terminal domain and C79 that is in direct contact with the ligand (Figure 
S2C). (i) The cluster 1 shows that a number of residues in green clusters around the active 
site are susceptible to the perturbation of residues in yellow clusters. Given that the active 
site is the core region for SAHH function, it is not surprising that the residues susceptible to 
perturbations are distributed around the binding cleft. Moreover, the region of central core 
channel is also detected to be a part of the main allosteric signaling network, which includes 
C195, R196, and E197. In particular, C195 is known to be a key residue for the catalysis and 
the maintenance of the reduction potential at the 3'-position |12]. (ii) The cluster 2 includes 
two spatially disconnected sets of residues in the C-terminal domain (from F425 to Y432) and 
the residues (G122, P123, D125) in the catalytic domain that are located away from the active 
site. A mutation in "cyan" clusters perturbs the residues in "red" clusters. According to the 
mutation studies, K426 and Y430 in this region are the critical residues that affect the cofactor 
affinity and/or the assembly of tetrameric structure [ISj HI]. Furthermore, although it was not 
detected in the SCA, a mutation of R49 in direct contact with D125 that belongs to the cluster 
2, is reported to dramatically reduce SAHH activity 

Network of hot spot residues responsible for the collective dynamics of SAHH: 

Normal mode analysis (NMA) based on the elastic network model (ENM) is used to grasp 
the caricature of SAHH dynamics evolving from the native topology. Along with the previous 
studies, |ini HZ] the covariance matrices calculated for the open and closed SAHH provide a 
general view of how the enzyme fluctuations in each form (see Figure 3A). The catalytic domain is 
more mobile than the cofactor-binding domain, and the overall flexibility decreases in the closed 
conformation. They also indicate that the cofactor-binding domain and the C-terminal domain 
of partner sub unit are cross-correlated (see black arrows), in consistent with the structural 
organization of SAHH dimer where the C-terminal domains are exchanged between the adjacent 
subunits. Interestingly, the catalytic domain of A (or B) subunit is cross-correlated with that 
of D (or C) subunit (see red arrows). The largest overlap of normal mode with the O— t-C 
conformational change {ro-^c) (see Methods) is found with the lowest frequency normal mode 
(mode 7) in both the open and closed forms (Figures 3B left and 3C left). The closure motion 
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of catalytic domain relative to the cofactor-binding domain and the relative rotation between 
AB and CD dimers are the main fluctuations characterizing the global dynamics of open and 
closed forms of SAHH. 

To determine "hot spot" residues controlling the O— t-C conformational dynamics, we 
employed the structural perturbation method (SPM) [23], which complements the results of 
SCA. The SPM mimics the point mutation of an enzyme by increasing the spring constant of 
a residue in the elastic network. Perturbation to a residue important for functional motion 
leads to a large change in the normal mode frequency (high-^o;, see Methods). For the closure 
dynamics described by mode 7 of the open SAHH structure, key residues with high-^o; are 
mainly distributed in the hinge region (FigjsjB right). Notably, this flnding conforms to the 
recent time-resolved fluorescence anisotropy measurements that probed the domain motions 
of mutant SAHH, where importance of residues M351, H353, P354 at the hinge region was 
highlighted. [18] On the other hand, for the dimer-dimer rotation dynamics described by the 
mode 7 of the closed structure, high-^o; residues are found at the interfaces between subunits 
(Fig|3p right). Key residues controlling the next higher frequency modes are distributed mainly 
at the interfaces between the subunits, contributing to the concerted intersubunit dynamics 
(see Figure S3 for mode 9). 

The global motion of SAHH during the O— t-C transitions upon ligand binding: 

The conformational dynamics and its dynamical response to ligand binding are prerequisite 
to understanding the enzyme function. To this end, we performed Brownian dynamics (BD) 
simulations using a coarse-grained model of the whole tetrameric structures in the holo and 
ligand-bound states (see Methods). Each trajectory consists of 750 fis simulations; the flrst 250 
IIS simulates the equilibrium stage of SAHH in the absence of ligand, and the next 500 /is is for 
the O— i-C transition dynamics in response to the ligand binding. We probed the ligand binding 
by using the distance between the fluorine atom of F-NpcA and C4 atom of the nicotinamide 
ring of the cofactor NADH (Figure S4). Among the total 60 simulated trajectories, we observed 
complete bindings of four ligands in 39 trajectories during our simulation time. A comparison 
of the fraction of native contact map (Qij) before and after the ligand binding revealed that 
Qij increases and decreases depending on residue pairs (Fig|4j\), suggesting that the structural 
reorganization due to O — t- C transition involves formation and breakage of multiple residue 
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pairs (Fig|4jA., 4B, and 4C). The Qij between the /35-a6 loop and aY hehx (marked with blue 
spheres in Fig|4|3), located at the entrance of the binding cleft, dramatically increases after the 
ligand binding, while the Qij between aAl, al7 and alb (marked with red spheres in Fig|4j3) 
decreases. In case of the inter-subunit contacts, key increases of residue contacts are made 
between the hinge region and aC helix of the partner subunit (Figjlp). 

Prior to the ligand binding, the enzyme structure remains largely in the open state as 
expected. Nevertheless, the fluctuation of the RMSD relative to the closed structure (Ac) 
occasionally brings SAHH structure to Ac < 2.5 A even without ligand in the binding cleft 
(Figjr^). As a result of the ligand binding, the Ac decays from 3.7 A to 2.9 A in 34 fis (Fig|4|3 
left), the angle a decreases from 69° to 53° (FigjljD middle), and the angle /3 changes from 5° 
to ^ 12° (Fig lib right). 



Kinetic hierarchy of the O— j-C transition: The inter-residue distance between i-th and j'-th 
residues, {dij(t)), where (...) denotes an ensemble average over trajectories, probes microscopic 
details of O— J-C transitions described above. Histograms of dij, P{dij), before and after the ligand 
binding quantify the change in average and fluctuation of inter-residue distance (Figjij). The 
average transition time for each residue pair, obtained from multi (mostly single) exponential 
fit to the {dij{t)), reveals the order of events that occur during the O— i-C transition dynamics 
upon ligand binding (Fig|6]). 

Closure dynamics of the binding cleft The structural motifs in the binding cleft, composed 
of several loops and an alpha helix, directly interact with a ligand that mediates the association 
between the catalytic and cofactor-binding domains (FigjljA.). We employed several residue pairs 
in these loops as reporters of intra-subunit closure dynamics. The dynamics of residue pairs 
associated with G277, H301, A340 and R343 from three representative loops consisting of binding 
cleft show single exponential decays with time constants of r = 35 — 38 fis (FigjsjA. green panel). 
The ligand binding reduces not only the average distances between the catalytic and cofactor- 
binding domains but also their fluctuations (see the inset of green panel in FigjsjA. for D131- 
H301 pair). These results are in good agreement with the experimental data of fluorescence 
anisotropy measurements, [49] conflrming that the intra-subunit closure dynamics within SAHH 
are substantially reduced after the ligand binding. Among the residue pairs we probed, H301 
is known to form key hydrogen bonding with the ligand [H |50] . Interestingly, some of residue 
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pairs associated with L347, G348 and M351, located at the entrance of binding cleft, have to 
expand during the ligand binding to accommodate the ligand to the active site (FigjsjA. blue 
panel), reminiscent of partial unfolding and refolding observed in protein-protein or protein- 
ligand bindings [121 EB ES] • 

Inter-subunit dynamics probed with the residue pairs at the subunit interface: The dynamics 
of inter-subunit residue pairs are responsible for the concerted motion of tetrameric SAHH. 
The inter-subunit residue pairs, including K166-G418, F189-Q251 at AB (or CD) interface, and 
W17-I321, W17-K322, K20- 1321 at AC (or BD) interface, exhibit a single exponential decay 
slower than that of the intra-subunit residue pairs (the magenta and cyan panels of Figj5|\). 
It is of particular note that many of individual time traces of inter-subunit residue pairs show 
bimodal hopping transition (FigjsjB); the histogram of (iAc(BD):Wi7-i32i can be described using 
a double gaussian function, centered at 8 A and 14 A in the absence of ligand. Upon ligand 
binding the population shifts toward c?ac(bd):W17-I32i ~ 8 A, reminiscent of population shift 
mechanism. 

Two step mechanism of open-to- closed transition dynamics: Comparison of transition times 
(r) of various residue pairs revealed an order of events that occur during the O— t-C dynamics. 
The hierarchical order of r (Figjsj) suggests that transition dynamics progresses from intra- 
subunit closure dynamics to the inter-subunit contact formations. Upon the ligand binding, 
(i) the intra-residue pairs in the cleft regions participating in the closure dynamics occurs with 
time scales of r 33 — 38 /xs. Many of intra-residue pairs, located in particular at the entrance 
of the binding cleft (blue spheres in FigjsjA.) show nonmonotonic kinetics with transition times 
r ~ 33 — 36 /is, followed by a angle transition in 37 /is. (ii) The inter-subunit residue pairs 
(cyan and magenta spheres in Figj5|\) and the intra-residue pairs at the central region of binding 
cleft (green spheres) decrease with decay times of r ^ 35 — 38 ^s. Of particular note is that 
the contact formation of residue pairs at the AC (or BD) interface precedes the full ordering 
of the binding cleft. Finally, the dimer-dimer rotation between AB and CD (/3 ^ 4° — 11°) is 
completed in 43 /is. 

To summarize (see Figures 5 and 6), in response to the ligand binding, the closure dynamics 
of the binding cleft due to the formation of intra-subunit contacts (blue), probed with (a), 
precedes both the inter-subunit residue contacts between AC or BD interface (cyan) and the 
more precise positioning of the active site (green). The rearrangement of interfacial contacts 
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(magenta) between A and B (or C and D) subunits and dimer-dimer rotation, measured 
with occurs at the stage later than the closure dynamics measured with a. From local 
compaction at the binding cleft to the global compaction of entire structure exists a hierarchical 
order in O — )■ C transition dynamics. 

Transition dynamics of SAHH in response to the ligand binding occurs via both 
modulation of energy landscape and selection of conformational ensemble: Two lim- 
iting mechanisms, induced fit and population shift (or conformational selection) (Figj7j\), are 
often contrasted in literatures to account for the allosteric dynamics associated with ligand 
binding or protein-protein association [53l [M] . In the SAHH dynamics, it is clear that the 
interaction between the ligand and SAHH (^ifSAHH-hg^ jg main cause of the O — )■ C tran- 
sition. However, it is not clear whether the ligand binding selects a small subpopulation of 
the closed structure from a pre-existing ensemble or modulates the folding landscapes. From 
the perspective of statistical thermodynamics, this question may be answered by projecting the 
population of accessible conformational states on a map in reduced dimension and comparing 
the two populations obtained from the holo and ligand-bound conditions. If the overlap between 
the statistical ensembles corresponding to the closed structure under the energy hamiltonians 
of the holo and ligand-bound states is substantial, then population shift is more plausible. In 
contrast, if there is little overlap between the statistical ensembles, then induced fit mechanism 
is favored. 

Because of the attractive contact pairs incorporated from the closed structure (see Methods), 
SAHH structure in the open form can transiently visit compact (or closed-like) structures even 
in the absence of ligand (Fig|7|3) We assessed the similarity between the ensemble of 

closed-like structures (CL) that satisfies Ac < -Rc, where i?c = 2.5 A in holo-structure and the 
ensemble of closed structures (C) that also satisfies Ac < Rc in the presence of hgand (Figures 
7C and 7D). Following the line of argument made above, if the enzyme recognizes the ligand 
solely by selecting a pre-existing conformational ensemble, a distribution of scattered plot using 
inter-residue distances ((iKio3-G346) c^Diai-Haoi) or angles (a,/3) as surrogate reaction coordinates 
will show a substantial overlap of CL ensemble with C ensemble. On the contrary, under the 
induced fit mechanism, we expect a difference between the distributions of CL and C. In Figures 
7C and 7D, C and CL structural ensembles show clear distinction, but with a certain degree of 
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overlap, giving evidence for both induced fit and population shift mechanisms. 

Depending on the molecule being studied, the concentration of ligand, and the position of 
probe being attached, one of the two mechanisms becomes more dominant [53]. For SAHH, 
the feature of conformational selection to the closed structure is observed in the residue pairs 
at the inter-subunit interface (see the dynamics of dAc(BD):wn-i32i in where one of the 
bimodal basins corresponding to the closed structure at dAc(BD):wn-i32i ~ 8 A becomes more 
populated upon the ligand binding) while the modulation of energy landscape, which alters 
the positions of energy minima, is more dominant in the dynamics of intra-subunit residue pairs. 

Characteristics of transition state ensemble of SAHH during the O— j-C transition: 

Identifying characteristics of the transition state of an enzyme along the reaction coordinate 
associated with its functional motion is an important task in designing an inhibitor drug that 
can fit to the active site. To this end, we analyzed the transition state ensembles (TSEs) of 
SAHH conformations by using Ac{t) and Ao(t) as surrogate reaction coordinates [T0l[35]. Upon 
ligand binding, {Ac(t)) increases and (Ao(t)) decreases, and they cross at a certain moment, 
i.e., {Ao{t)) = {Ac{t)) (|4|3 left). Transition states can be collected from the individual time 
traces by imposing a condition \Ao{t) — Ac{t)\ < 6 with 6 = 0.01 A for intra-subunit and 



routes are formed on a plastic energy landscape. A Tanford /3-like paramter, defined using the 
equilibrated distances of open form (xq) and closed form (xc), Pt = {xts — xc)/{xo — xc), 
evaluates the similarity of TSE to the ligand-bound state [35j]- We obtained /3r = 0.33 and 
0.44 for the dynamics of both distance and angle pairs, suggesting that the TSEs resemble the 
ligand-bound (closed) state (/3t < 0.5). 

IV. CONCLUDING REMARKS 

Allosteric regulation of protein function is encoded in the sequence space of protein family 
and in the native topology of protein architecture. Functional and energetic restraints imposed 
by the residue network give rise to the close correlation between conformational fluctuations 
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around the native state and transition dynamics in response to the hgand binding although 
the pattern and time scale associated with these dynamics differ in each molecule due to the 
stochastic nature of dynamic processes and plasticity of conformational space. The analysis of 
various inter-residue dynamics from our BD simulations revealed that upon ligand binding, the 
local signals at the active site of each subunit are transmitted progressively to the inter-subunit 
contact formation that in turn invokes the precise alignment of the catalytic elements at the 
active site, followed by the dimer-dimer rotation. Along with the experimental studies reporting 
the effect of mutation on the SAHH activity [HI HH US], our multi-faceted study of SAHH 
dynamics suggests that the enzymatic function is not simply represented by the relationship 
between the active site and ligand, but is the consequence of communication between amino 
acids nonlocally wired over the entire structure. Together with the list of evolutionarily and 
topologically important residues identified by the SCA and SPM (|2| [s] and Table 1), the 
dynamic picture of O — t- C transitions of SAHH shown in this study is amenable to further 
experimental investigations. 
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TABLE I: Evolutionarily co- varying residues identified by statistical coupling analysis (SCA). 

* residues identified in both the perturbation and position clusters. 

^ highly conserved residues. 
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FIG. 1: Structure of SAHH. A. A subunit structure consisting of the catalytic (residues 1-181 and 
355-385), cofactor-binding (residues 197-351) and C-terminal domains (residues 386-432) are shown 
on the left. The angle a is defined among W112, K186, and W310. A dimer structure assembled by 
exchanging the C-terminal domains is shown on the right with structural motifs associated with the 
dimer interactions being colored in blue and red. B. Homo-tetrameric structure. Each monomer is 
represented by blue, cyan, purple and orange surfaces. /3 is the angle between the two vectors, one 
from CMa to CMb and the other from CMc to CMd , where CMx is the center of position of a subunit 
X. Electrostatic potentials on the molecular surface calculated by solving Poisson-Boltzmann equation 
at 0.2 M salt condition show negatively charged central core channel and active site. The side view of 
the structure and the dimer-dimer interface are shown on the left and right, respectively. 
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FIG. 2: Statistical coupling analysis of SAHHs. A. AGi/kT* value of the i-th residue in SAHH 
monomer. The catalytic (cyan), hinge (magenta), cofactor-binding (dark blue), and C-terminal tail 
domains are depicted in different colors. B. Highly conserved residues with AGi > 0.75 from A are 
mapped on the monomer (left) and dimer (right) structures. C. Reordered AAGy matrix that identifies 
the two clusters (white boxes) is shown in the middle and the corresponding two co-evolving clusters 
of sequences are depicted on the SAHH structure (see SI). For cluster 1, perturbation and position 
clusters are colored in yellow and green, respectively. For cluster2, perturbation and position clusters 
are colored in cyan and red, respectively. 
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FIG. 3: Normal mode analysis and structural perturbation method applied to SAHH. A. (Left) Com- 
parison between mean square fluctuation (cyan) and crystallographic b-factor (brown) of the open form 
of rat SAHH (PDB code: 1B3R). (Right) Covariance matrices of SAHH open (left) and closed (right) 
forms. The differences between the two matrices are shown in the middle. Our calculations using ENM 
compare well with those in previous study [46] . B. Overlap between fpc and normal mode M {vm) 
of the open structure. Maximally overlapping normal mode (M = 7) are displayed using lines in the 
structures (left). The effect of structural perturbation, 5w(M, i), is calculated for ^^1=7 (middle). The 
residues with hi^-5uj value with respect to vm=7 are depicted in the structure on the right and listed 
in the Table SI. C. The same calculation for the closed structure by using I'm =7 as the maximally 
overlapping mode with fpc- 
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t (^s) t (us) t iixs) 

FIG. 4: BD simulations of the O— t-C transitions in SAHH. A. Native contact probability (Qij) changes 
as a result of the conformational change in response to the ligand binding. Among the native contact 
pairs, Qij are colored in blue (red) when Qij increases (decreases) more than 10 %. The position 
of corresponding residue pairs in a SAHH subunit and at the subunit interface are depicted in the 
structures on the right. B, C. Representative time traces of native contacts Qij{t) for the intra- (B) 
and inter-subunit (C) residue pairs. The residue pairs with SQij{t) < after ligand binding are marked 
in red, and those with 6Qij{t) > are in blue. 189-361 (red), 185-364 (magenta), 367-394 (brown), 
83-347 (blue), 83-348 (cyan), 82-343 (turquoise); AC(BD):231-235 (red), AC(BD):234-235 (magenta), 
AB(CD):185-247 (blue). D. Time-dependent changes of root mean square deviation (RMSD) relative 
to the open and closed forms ((Ao) and (Ac)), (a) and {(3) are shown from left to right. 



23 




FIG. 5: Dynamics of intra- and inter-subunit residue pairs. A. Intra- and inter-subunit dynamics 
averaged over the ensemble of trajectories probed by (dy) and the corresponding histogram, P{dij), 
before (black line) and after (colored area) the ligand binding. The intra-subunit residue pairs consisting 
of the catalytic site exhibit monotonic (green spheres) and non-monotonic decays (blue spheres). The 
inter-subunit residue pairs at AB (or CD) and AC (or BD) interfaces are displayed in purple and cyan 
spheres, respectively. The list of residue pair dynamics depicted in each panel is: 79-301, 131-301*, 
157-301, and 159-301 for green spheres; 83-347, 83-348* 83-351, and 85-351 for blue spheres; AB(CD) 
166-418* and AB(CD) 162-416 for magenta spheres; AC(BD) 17-321*, 17-322 and 20-321 for cyan 
spheres, where anterisk (*) denotes the residue pairs whose histogram is shown in the insets. B. An 
individual time trace of dAC(BD):Wi7-i32i(^) ^it the AC subunit interface displays a biomodal hopping 
transition, and its population change is shown in the upper right panel. The histograms before and 
after the ligand binding (blue and red lines, respectively) fitted to a double Gaussian function (black 
lines) show that there is a "shifting of population" from dAc(BD):Wi7-i32i ~ 14 A to 8 A. 
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FIG. 6: Kinetic hierarchy revealed in the SAHH dynamics. Average transition times for various residue 
pairs discussed in [5] and a, f3 angle dynamics are shown in an increasing order. The colors of bar graphs 
are identical to those used in the previous figures. 
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FIG. 7: Comparison between the closed-like (CL) and closed structures (C). A. A schematic diagram 
of molecular recognition processes contrasting induced fit and conformational selection mechanisms. 
B. Exemplary time traces of visiting the CL state prior to the ligand binding (t < 250 fisec, 
Ac < Rc = 2.5 A) and the C state in the presence of ligand {t > 250 fisec, Ac < Rc = 2.5 A). C. 
Scattered plot using two intra-subunit residue distances. Ensembles of CL and C states are displayed 
in black and green, respectively, and their histograms are shown in the upper and right side of the 
graphs. For comparison, the projected space sampled by the whole trajectories before and after the 
ligand binding are shown in cyan and brown. D. Scattered plot of CL and C ensembles using a and 
/3. 
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FIG. 8: Transition state ensembles of the O— t-C transitions. A. Time-dependent changes of Aq (black) 
and Ac (green), in a sample trajectory. The instances of \Ao{t) — Ac(t)| < 6, which defines the 
transition state, are marked with red arrows. B. The TSEs represented with distances of two intra- 
subunit residue pairs (left) and with a and /3 angles (right). On the two-dimensional maps representing 
TSE, the average positions of the TSE are marked with blue spheres, and the average distances and 
angles for open and closed forms are marked with the black and green circles. The calculated Tanford 
/3-like values for the two maps are 0.33 and 0.44, respectively. 
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FIG. SI: The secondary structural elements in SAH hydrolase. The alpha helices of the catalytic, 
cofactor-binding and C-terminal domains are colored by cyan, blue and orange, respectively, while 
those of the hinge region are in magenta. The beta sheets and loops are colored by light-blue. The 
nomenclatures of the structural motifs follow those in the study by Turner et al. |56js: al, 12-28; q2, 
30-39; (33, 49-53; a4, 58-69; (35, 73-77; q6, 86-95; (37, 99-101; a8, 107-118; /39, 127-130; alO, 134-142; 
all, 144-149; (312, 152-155; al3, 158-170; /314, 177-179; aAl, 184-188; qA2, 189-207; (3A, 215-219; aB, 
223-234; (3B, 238-242; aC, 246-254; (3C, 258-260; aD, 262-268; (3B, 271-274; aE, 284-289; (3E, 294-298; 
aF, 308-314; (3X, 315-322; (3Y, 325-330; (3F, 333-339; aX, 340-342; aY, 345-349; al5, 355-374; ^16, 
383-385; al7, 388-402; al8, 411-417. 
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FIG. S2: Results of statistical coupling analysis. A. Distribution of average free energy for subalign- 
ment, (Gf), for various sizes {Ns = / x Nmsa)- B. Among SCA-identified hot-spot residues, highly 
conserved residues that satisfies AGi > 0.75 are mainly found in the C-terminal domain and at C79. 
Green and red surfaces represent C79, and residues in C-terminal domain, respectively. Black spheres 
at each subunit represent the cofactor and ligand. 



30 




FIG. S3: Results from structure perturbation method. 5ijJ values with respect to the next highly 
overlapping mode {um=9) with tq^c for the open (orange) and closed (skyblue) forms (top). The 
i)M=9 for the open and closed forms are depicted on the structure with red and blue lines, respectively, 
and listed in the Table S2. 
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FIG. S4: Ligand binding probed using the distance between the ligand and cofactor. 
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FIG. Table SI: The list of hot spot residues identified with SPM for the mode 7 in open and closed 
forms. 
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FIG. Table S2: The list of hot spot residues identified with SPM for the mode 7 in open and closed 
forms. 
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FIG. Movie SI: A simulation of ligand-binding induced conformational change of SAHH. 
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FIG. Movie S2: A simulation of ligand-binding induced conformational change of SAHH, a cleft view 
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